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Abstract. In this paper the Bayesian analysis is applied to assign a probability 
density to the value of a quantity having a definite sign. This analysis is logically 
consistent with the results, positive or negative, of repeated measurements. 
Results are used to estimate the atom density shift in a caesium fountain clock. 
The comparison with the classical statistical analysis is also reported and the 
advantages of the Bayesian approach for the realization of the time unit are 
discussed. 
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1. Introduction 

The Bayesian analysis is acquiring increasing importance in metrology; a tutorial guide 
can be found in Sivia's book [1] and a comprehensive and authoritative account is the 
treatise by Jaynes |2]. The present paper illustrates the use of the Bayes theorem to 
infer the coUisional coefficient of atom-fountain clocks - from theory, a negative one - 
from the results of a sequence of measurements. The paper illustrates in a very simple 
way the differences between the orthodox and the Bayesian analyses, which differences 
are usually unnoticed. 

When the data are used to estimate the measurand, the orthodox analysis 
concerns the probability density of the estimate [3]. Instead, the Bayesian approach 
concerns the probability density of the measurand, given the single data set generated 
by measurement and prior possible information. If the data are unbiased normal 
variables, in the absence of prior information, the same multivariate Gaussian function 
describing the estimate distribution is also the probability density of the measurand. 
Since this function is invariant when the estimate is exchanged for the measurand, the 
orthodox and the Bayesian inferences, though conceptually different, are numerically 
the same. This explains the limited awareness of the risk of using the solution of one 
problem as the solution of the other. 

Had the data been sampled from a distribution not having the above symmetry, 
the difference between the probability densities of the estimate and of the measurand 
cannot be unnoticed. An interesting case is that of a positive measurement result 
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Figure 1. A sample from a Gaussian distribution with zero mean and unit 
variance has a 50% probability to be positive. If it lies in the 16% distribution 
tail (filled in yellow), the classical confidence interval is in the positive region. 
Examples of measurement results with an associated uncertainty entirely in the 
positive region are also shown. 

when the measurand is negative. Though the measurement resuh is an unbiased 
estimate, this inference looks quite strange. As exemphfied in Fig.[Tl if the measurand 
approaches zero when compared to the measurement uncertainty, there is nothing 
unusual about obtaining a positive value. Assuming a Gaussian sampling distribution, 
there is almost a 50% probability that the measurement result is positive and about 
a 16% probability that the 68% classical confidence interval (the result plus/minus 
the uncertainty) lies in the non-physical region. Though the confidence interval is 
instrumental only in the assessment of the estimate and not of the measurand [4] , this 
view is not shared by most metrologists. They figure out a Gaussian function centred 
on the estimated value and discuss whether the measurand lies within the confidence 
interval. In the above situation, the consequent paradoxical confidence interval in 
the non-physical region is explained by observing that, according to a seminal 1937 
paper by Neyman [4], the confidence interval is so defined as to keep strictly to a 
statement about the probability density of the measurement result without considering 
the probability density of the measurand. According this statement, independently of 
the measurand value, only 68% of the intervals obtained from a set of measurements 
will contain the measurand. 

2. Repeated measurements of a positive quantity 
2.1. statement of the problem 

Let us suppose that the measurement of a quantity y, known to be negative, yields 
value yi . The measurement result ym is assumed normally distributed about the value 
yo of the unknown quantity, with a known standard deviation ai. The probability 
density of obtaining the measurement value yi is 

Py^{y„i ^ yi\y = yo) = —= — exp . (1) 

The question is how to account for j/o < and to infer a consistent value for the 
measurand. 
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2.2. Bayesian solution 

In the absence of any additional information, the Bayesian considers the < 
constraint in the probabihty density of the measurand values before the measurement 
result is know, 

Py{y = yo)^^{~yo), (2) 

where '&{x) is the heaviside function. According to the Bayes theorem, the post-data 
probability density of the possible y values, given the measurement result ym = yi, is 
the tail of Py^^ (j/,„ = yi\y = yo) lying in the yo < interval, 

Py^{ym = yi\y = yo)Py{y = yo) 



Py{y = yo\y,n = yi) „ , . 

PyAym. = yi) 

'^exp[-(yo-yi)V2]^(-2/o) 



^/2^erfc(2/l/^/2) 

where we set di = 1 since this requires only a trivial change of variables. The 
probability density Q embeds both the information available independently of the 
measurement result, that is, yo < 0, and the information delivered by the yi datum. 

2.3. Bayesian inferences 

To convert the measurand post-data probability density ^ into a single numerical 
estimate, y, the loss, £(y — y), associated with the estimate error must be specified. 
The £(y — y) function maps the error onto a number representing the cost associated 
with a wrong estimate. The optimal estimator minimizes the expected loss over the 
measurand distribution, that is, y = arg^^ minE[£(y — y)] , where 

/ + 00 
£(y-yo)P^(yo|yi)dyo. (4) 
OO 

The squared-error, that is, £(y — y) = {y — y)^, mdicates the mean [S], 

v/27^exp(-y^/2) 

erfc(y,/V2) ' ^'"^ 

the absolute-difference, that is, £(y — y) = |y ~ indicates the median, which is 
implicity given by 

- 2erf [(y - yi)/V2] = 1 + erf(yi/x/2), (56) 

and £(y — y) = const, indicates the mode. 

r yi ifyi <0 

^~ 1 ifyi>0 ■ ^^^^ 

Confidence intervals are easily expressed by integrating ([3]) to obtain the relevant 
cumulative distribution function. 

The expected value and the median of the measurand are shown in Figs. [5] and [3] 
together with the standard deviation and quartiles. When the measurement result is 
negative and large, both the expected value and the median coincide with the measured 
value. In this case, both the orthodox and Bayesian analyses deliver the same estimate. 
In fact, Py{y = yo\ym ~ yi) is identical with Py^{ym = yi\y ~ yo) for all practical 
purposes and the symmetry Py^{ym = yi\y = yo) = Pyn^iym = yo\y = yi) allows 
the two probability densities, though conceptually different, to be identified. When 
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Figure 2. Expected value of a negative quantity given a measurement value 
sampled from an unbiased Gaussian distribution with unit variance. Lower and 
upper lines indicate the plus/minus one standard deviation interval. The dashed 
line is the orthodox estimate of the measurand. 
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Figure 3. Median of a negative quantity given a measurement value sampled 
from an unbiased Gaussian distribution with unit variance. Lower and upper 
lines are the first and third quartile. The dashed line is the orthodox estimate of 
the measurand. 



the measurement result approaches zero, or even is positive, the difFerenee between 
the orthodox and the Bayesian analyses is evident. The measurement result is now 
a very poor estimate of the measurand value. On the contrary, as shown in Figs. [2] 
and [3l the Bayesian approach, based on the post-data probability density, does not 
have any difficulty. The smaller (in absolute terms) the measurement result is, the 
nearer to zero, but negative, is the estimate; it is to be noted that, in this case, also 
the uncertainty of the estimate decreases. 



2.4- extension to repeated measurements 

The previous analysis can be extended to N repeated independent measurements. In 
this case, the post-data probability density of the y values is 

Pyiy = yo) I\f=i Py^ {vm = yi\y = yo) 



Py{y = yo\ym = {yi,2/2, ■•■ yw}) 



Pym.{ym = {?/i,y2, •■■ yjv}) 
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_ 2exp[-(yo-y)V2]^(-;;o) 
[1 + erf(y/V2)] ' 

where y is the weighed sample mean and the y variance, = ( ^ l/o"?) l^^s been 
set to one. To compress © into a single numerical estimate of y, we can still use 
(|5al-c). where the sample mean y substitutes for yi. 

In another way to process data, we observe that y is an estimate of y normally 
distributed about yo. If the data are numerous, y is expected to be negative. Therefore, 
it might seen to be an optimal estimate. However, from a Bayesian viewpoint, given y 
and 2/0 > 0, the starting points to calculate the probability density of the measurand 
values are again ^ and , where y and <7y substitute for yi and ai . The probability 
density of the measurand is identical to (l6|), as expected. Hence, if the cost function 
is quadratic, the optimal estimate is the expected value of the measurand over the 
probability distribution of its possible values, given by ()5a|) . But this expected value is 
greater than y, as shown in Fig. [2l To explain why the Bayesian estimate (|5a|) differs 
from the orthodox, we must observe that now, owing to the prior information, the 
probability densities of the estimate and of the measurand are not the same function. 

3. Evaluation of the collisional coefRcient of a Cs fountain 

An interesting application of previous results is offered by primary frequency 
metrology. Ultracold Cs fountains realize the time unit, the second, with a relative 
uncertainty of few parts in 10~^^. This achievement has been made possible by the 
use of atomic Cs samples cooled down to about 1 fiK. A detailed description of this 
frequency standard can be found in [6] and references therein. Caesium fountains 
implement a Ramsey spectroscopy scheme, where a sample of cold atoms is launched 
in a ballistic flight. During the flight, the atoms go upwards and then come back 
downwards passing twice through a microwave cavity and interacting with a microwave 
radiation at 9 192 631 770 Hz. Then, an optical detector counts how many atoms have 
undergone the hyperfine transition induced by the microwave radiation. The transition 
probability is then calculated, thus obtaining the frequency difference between the Cs 
reference and the interrogating microwave. The SI second, the definition of which dates 
from the 13th Conference Generale des Poids et Mesures in 1967, is the duration of 9 
192 631 770 periods of the radiation corresponding to the transition between the two 
hyperfine levels of the ground state of the Cesium 133 atom. Therefore, the realization 
of the second should rely on a single and unperturbed atom, while atomic fountains 
use samples of about 10^ — 10® atoms. 

In these samples, collisions occur among the ultra-cold atoms with consequent 
perturbation of the energy levels and shift of the atomic reference frequency. This 
shift, called collisional or atom density shift, is proportional to the atom density and it 
must be carefully evaluated in order to correct the frequency standard. Its uncertainty 
is one of the main contributions to the accuracy budget of an atomic fountain. As 
collisions play a crucial role in high precision clocks, they have been the subject of 
many theoretical and experimental studies [7j. 

It has been demonstrated theoretically and experimentally [8| that, if a fountain 
is operated in a way to make the collision energy high during the whole ballistic flight, 
the collisional shift has a weak dependance on the atom temperature. This is the 
case, for example, of direct molasses capture or molasses expansion after magneto- 
optical-trap capture [5]. Moreover, if the atomic sample is a quantum mixture of the 
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Table 1. Clock-frequency sensitivity to the atom density (coUisional coefficient) 
as measured in a Cs fountain accuracy evaluation 



coUisional cocfBcicnt (uncertainties) / arbitrary units 



-2.97(5.19) 


-3.27(3.96) 


-6.41(3.64) 


-1.42(4.85) 


+4.31(3.76) 


-1.88(3.53) 


-5.35(3.91) 


-5.71(3.69) 


-4.04(3.49) 


-7.48(3.66) 


-1.37(3.71) 


-2.41(3.66) 


-3.60(3.93) 


-4.79(4.54) 


-4.56(4.43) 


-1.93(4.56) 


-6.41(3.57) 


-2.23(3.48) 


-4.75(3.72) 


-1.47(3.49) 


-4.56(3.29) 


+2.59(3.71) 


-2.76(3.75) 


+0.47(3.72) 


-0.97(3.98) 


+2.17(3.95) 


-4.75(3.98) 


+0.58(3.82) 


-0.81(3.79) 


-3.64(4.22) 


-5.71(4.61) 


+2.99(4.89) 


+0.97(6.82) 


-2.90(4.82) 


+2.17(4.66) 


-5.53(4.01) 


-0.40(4.22) 


-4.48(3.81) 


+0.10(3.84) 


-3.87(3.93) 


-3.43(4.32) 


-5.13(4.33) 


-5.42(4.00) 


-3.19(4.22) 


-1.35(4.40) 


-0.80(4.32) 


-0.08(4.28) 









hyperfine eigenstates \F = 3, mp = 0) and \F = 4, mp = 0), as occurs in the fountain 
operation, the coUisional shift has also a negligible dependence on the mixture ratio 
and it is linearly dependent on density through a negative coefficient. 

Provided some technical parameters are known and kept stable the density shift 
estimation is possible, but with high uncertainty. An evaluation of the density shift of 
lower uncertainty is possible by a differential measurement technique: the fountain is 
operated alternately at high and at low atom densities. Then, the two measurement 
results are used to extrapolate the frequency to the zero density, with use of a 
linear regression. This differential measurement evaluates the atom density frequency 
sensitivity; we refer to this sensitivity as the coUisional coefficient. This measurement 
technique requires that the parameters linking the output signal to the atom density 
remain stable only for the differential measurement time instead of during the whole 
measurement duration. The evaluation of the density shift must be repeated regularly 
to cope with the possible instabilities. 

Since the leverage of the differential measurement is not particularly high, the 
frequency shift often approaches the clock resolution. Therefore, the measured values 
of the coUisional coefficient have an uncertainty comparable to their magnitude. Table 
1 records measurement results, together with their uncertainties, collected during an 
accuracy evaluation of INRIM fountain IT-CsFl. The coUisional coefficient value 
expected from theory is negative and the overall measurement values confirm this 
assertion. Nevertheless, some of the values are positive, in agreement with a Gaussian 
dispersion. These data are suitable for Bayesian analysis; in fact, when using an 
orthodox approach, we are not able to take into account that we know that the 
coUisional coefficient is negative. 

The post-data probability density of the measurand is shown in Fig. [4l owing to 
the relatively sharp peak, the Bayesian cut of the positive tail of the distribution has 
no practical effect. Both the weighed mean of the sample and the Bayesian expected 
value of the measurand are —2.58 ± 0.58. We can also infer the sensitivity value 
from the 4.31 ± 3.76 datum, the confidence interval of which is entirely in the non- 
physical region. The result is —lAbfl'^Q (median plus/minus quartiles). Despite the 
unsatisfactory datum, the Bayesian inference is really good, when compared to the 
much more accurate value inferred from many measurement repetitions. In general, 
when the datum is negative the Bayesian and orthodox estimates tend to coincide. 
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Figure 4. Post-data probability density of the coUisional coefficient, given 
tfie measurement values and uncertainty in Table 1 (Gaussian peak) and the 
measurement values 4.31 ± 3.76 (exponential line) and —2.90 ± 4.82 (broad 
Gaussian curve). 

The worst case is when the measurement result is negative, as expected, but has large 
uncertainty; for example, if the — 2.90±4.82 datum is considered, the Bayesian median 
plus/minus quartiles is — 4.59^3 40. 

4. Conclusions 

The Bayes theorem accounts for prior information at the very beginning of data 
analysis, in a way ensuring the logical consistency of statistical inferences. In the 
specific example here considered, the theorem demonstrates capable to cope with a 
problem the orthodox solution of which is unsatisfactory. 

As regards primary frequency metrology, the results we obtained imply two main 
consequences. Firstly, by use of the sign information, the density shift of a Cs-fountain 
primary standard can be estimated within a given uncertainty from a smaller data 
set. Hence, the number of collisional coefficient measurements can be reduced and 
the density shift evaluation can be made faster, without increasing the uncertainty. 
Besides, the reduction of the evaluation time is a desirable feature of a Cs fountain. 
Secondly, each measured density coefficient is immediately used to extrapolate the 
fountain frequency to zero density. Therefore, a proper use of the sign information to 
extrapolate a single measurement pair to zero density will improve the measurement 
capabilities and the detection of failures and anomalies. This will be the subject 
matter of future work. 
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